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1.  Introduction 

Statisticians  use  the  Internet  and  the  World  Wide 
Web  (WWW)  to  exchange  e-mail  and  to  view,  re¬ 
trieve  and  distribute  research  papers,  data  sets,  com¬ 
puter  source  code  and  other  text  materials.  In  ad¬ 
dition  to  the  text,  data  sets,  and  graphs  that  are  in 
paper  reports,  electronic  papers  can  include  sound, 
animation,  video  and  other  computer-based  media 
products  that  can  improve  the  presentation  of  the 
results.  Recently  it  has  become  possible  to  embed 
executable  content  —  programs  —  into  electronic 
documents.  A  statistician  can  now  write  computer 
programs  to  perform  calculations  described  in  the 
paper,  embed  the  code  in  the  paper,  and  distribute 
the  paper  over  the  Internet.  Any  reader  of  the  paper 
can  execute  the  code  on  his  or  her  computer  and  can 
interact  with  the  executing  program.  This  capabil¬ 
ity  transforms  what  was  once  a  static  presentation 
of  research  results  into  a  presentation  that  can  be 
dynamic  and  interactive.  This  paper  describes  how 
the  new  capability  can  change  the  way  statistical  re¬ 
search  is  conducted.  An  illustration  is  given  from 
the  field  of  vision  research.  In  this  real-life  problem, 
researchers  need  to  find  the  best-fitting  ellipses  for 
particular  sets  of  data  that  they  gather.  Our  soft¬ 
ware  allows  any  interested  researcher  with  a  browser 
to  compute  the  relevant  parameters,  get  estimates  of 
their  standard  errors,  and  draw  pictures  to  illustrate 
the  quality  of  the  fit. 

2*  Dynamic,  Interactive  Documents 

The  most  widely  used  technology  for  embedding 
executable  code  into  electronic  research  papers  is 
the  Java  programming  language.  A  compiled  Java 
program  can  be  attached  to  a  Web  page  and  then 
downloaded  over  the  Internet  and  executed  on  the 
user’s  machine  using  a  Java-capable  browser.  Most 
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browsers,  including  the  most  widely  used  ones,  are 
Java-capable.  Java  is  a  full-featured  object-oriented 
programming  language  that  was  designed  to  execute 
without  modification  on  a  variety  of  different  com¬ 
puters,  including  PC’s,  Apple  computers,  and  most 
Unix  machines.  Although  Java  is  not  the  only  tech¬ 
nology  to  be  embedded  in  Web  documents  and  dis¬ 
tributed  over  the  Internet,  it  is  the  most  widely  used 
and  widely  supported. 

Under  the  control  of  the  reader,  a  Java  program 
can  be  executed  on  the  reader’s  computer.  This 
“dynamic”  execution  can  produce  the  full  range  of 
computation  and  presentation  of  results  possible  on 
modern  windows-based  computers.  This  includes 
animation,  sound  and  full-color  display  of  results. 
In  addition,  the  reader  can  “interact”  with  the  run¬ 
ning  program  to  control  its  execution  and  to  supply 
data. 

2*1  Embedded  Algorithms 

The  “dynamic”  and  “interactive”  capabilities  are 
both  important  to  the  propagation  of  statistical  re¬ 
search,  but  it  is  the  second  that  is  perhaps  the  more 
vital.  It  is  common  practice  today  for  a  researcher  to 
produce  an  algorithm  or  technique,  to  try  it  out  on 
some  data,  and  to  publish  the  algorithm  as  part  of 
a  research  paper.  In  some  cases  the  reader  is  shown 
neither  the  full  code  that  implements  the  algorithm, 
nor  the  data  on  which  it  was  run.  In  others,  the  code 
is  available  only  in  a  form  that  many  readers  cannot 
use.  Even  in  the  most  benign  case,  it  generally  falls 
to  the  reader  to  “port”  the  code  to  his  or  her  envi¬ 
ronment,  and  to  validate  the  algorithm  with  his  or 
her  own  data. 

With  the  recent  advances  in  Internet  technology, 
an  algorithm  can  be  embedded  inside  a  research  pa¬ 
per.  This  allows  the  reader  not  only  to  see  the  results 
of  the  algorithm  as  it  applies  to  the  author’s  data, 
but,  given  access  to  the  data,  to  reproduce  those  re¬ 
sults.  This  allows  immediate  reproducibility  of  the 
author’s  computational  results. 

Even  more  importantly,  an  embedded  algorithm 
can  permit  the  reader  to  use  it  on  his  or  her  own 
data.  This,  after  all,  is  often  the  reader’s  motivation: 
to  use  the  newly-described  technique  to  solve  one  of 
his  or  her  problems.  By  applying  the  algorithm  to 


his  or  her  own  data,  the  reader  can  evaluate  the 
robustness  of  the  procedure  as  well.  All  of  this  is 
made  instantly  accessible  to  any  user  who  has  the 
use  of  a  Web  browser.  Readers  who  are  from  other 
disciplines  —  who  are  less  often  able  to  “port”  code 
and  who  are  frequently  in  position  to  benefit  most 
from  the  fruits  of  the  research  —  can  find  these  pre¬ 
written  algorithms  particularly  useful. 

It  is  certainly  applied  papers  that  can  benefit  most 
from  technologies  that  permit  dynamic  visualization 
and  from  the  ability  to  embed  algorithms.  There  will 
always  be  a  place  in  statistics  for  research  of  a  theo¬ 
retical  nature  that  requires  no  pictures  or  computer 
programs  to  show  its  results.  Still,  these  papers,  too, 
can  benefit  from  the  new  technologies.  In  addition 
to  “links”  to  whatever  data  might  be  used  in  exam¬ 
ples,  a  paper  might  maintain  links  to  reference  ma¬ 
terials,  other  papers,  and  statistical  encyclopedias, 
for  example,  for  definitions  and  examples  of  terms 
that  might  not  be  in  the  casual  reader’s  vocabulary. 
Every  researcher  has  faced,  in  reading  a  research  pa¬ 
per,  the  task  of  locating  reference  papers  that  cannot 
be  found  in  the  local  library;  we  envision  a  world  in 
which  most  of  those  papers  are  one  mouse-click  away. 
The  emerging  Internet  technologies  can,  and  should, 
change  the  way  that  statisticians  do  research. 


2.2  Java  and  Its  Rivals 

Of  course  none  of  these  advances  require  the  use 
of  the  Java  language  specifically.  However,  this  lan¬ 
guage  has  proven  itself  to  be  well-adapted  to  Internet 
concerns  through  its  built-in  network  awareness  and 
inherently  secure  design.  Some  drawbacks  remain 
in  these  areas.  Currently  (as  of  October  1997),  Web 
browsers  do  not  allow,  for  security  reasons,  embed¬ 
ded  Java  programs  to  read  or  write  to  files  on  the 
reader’s  machine.  This  is  not  inherent  to  the  Java 
language  nor  to  Java’s  security  model;  the  next  gen¬ 
eration  of  browsers  will  allow  a  reader  to  control  ac¬ 
cess  to  the  files  on  his  or  her  machine,  allowing  a  Java 
program  to  access  data  from,  and  write  results  to, 
particular  local  files  or  files  in  particular  directories. 
Furthermore,  the  language  has  been  slow.  Initially, 
Java  programs  were  one-tenth  to  one-twentieth  as 
fast  as  C,  C+4-,  or  FORTRAN  programs  perform¬ 
ing  the  same  calculations.  Recently,  new  compiler 
technology  has  reduced  this  disparity  and  has  the 
potential  for  making  Java  programs  as  fast  or  faster 
than  other  leinguages.  Development  time  in  Java  is 
generally  reduced  because  of  the  language’s  simple 
nature  as  embodied,  for  example,  in  the  absence  of 
pointer  arithmetic. 


2.3  The  Future  of  the  Research  Paper 

The  primary  medium  to  report  and  to  preserve  re¬ 
search  results  has  been  the  research  paper.  We  ex¬ 
pect  that  this  will  continue.  In  recent  years  there  has 
been  a  great  increase  in  “electronic  publication”;  it 
was  estimated  in  May  1995  that  there  were  some 
300  electronic  research  journals  (Economist  (1995)). 
The  obvious  advantage  of  an  electronic  publication  is 
the  speed  with  which  it  can  be  widely  disseminated. 
However,  part  of  the  reduction  in  distribution  time 
has  come  from  avoiding  the  refereeing  process  that 
is  so  critical  to  the  advancement  of  knowledge.  This 
avoidance  is  certainly  not  inherent  in  the  technol¬ 
ogy,  and  refereeing  procedures  can  and  have  been 
set  up  to  certify  the  quality  of  publications.  After 
the  publication  has  been  refereed,  electronic  distri¬ 
bution  is  instant.  Journal  publication  requires  addi¬ 
tional  effort  to  prepare  for  publication  (possibly  the 
creation  of  galley  proofs  and  additional  proofread¬ 
ing)  and  then  a  wait  in  a  publication  queue  that 
may  be  months  or  years  long.  However,  once  an  ar¬ 
ticle  has  appeared  in  a  journal  it  has  permanence  — 
it  will  always  be  available  —  and  it  has  stability  — 
it  will  be  the  same  to  all  readers. 

In  order  for  electronic  publication  to  be  accept¬ 
able  for  archival  publication  of  research  there  must 
be  procedures  to  protect  the  existence  and  integrity 
of  the  publication.  For  existence  the  author  and 
the  professional  community  must  be  assured  that 
the  publication  will  be  easily  and  cheaply  available 
to  scholars  for  at  least  hundreds  of  years.  A  wide¬ 
spread  computer  virus  could  produce  the  electronic 
equivalent  of  the  burning  of  the  library  of  Alexan¬ 
dria.  There  must  be  additional  procedures  that  as¬ 
sure  that  the  publication  continues  to  exist  in  its 
original  form;  there  can  be  no  undocumented  revi¬ 
sions. 

Assuming  that  the  primary  requirements  of  exis¬ 
tence  and  integrity  are  met,  we  can  consider  addi¬ 
tional  benefits.  The  publication  could  be  made  avail¬ 
able  at  very  low  cost  by  a  provider  who  would  supply 
only  electronic  access;  the  user  would  bear  any  cost 
of  transferring  it  to  paper.  In  1991  researchers  at 
Los  Alamos  National  Laboratory  set  up  a  service  to 
store  and  distribute  electronic  research  preprints.  It 
has  been  reported  (Economist  (1995))  that  they  han¬ 
dle  40,000  requests  per  day  from  20,000  researchers. 
Electronic  publications  can  provide  instant  access  to 
references  that  are  also  available  on  the  Internet. 
While  the  economics  of  electronic  publication  have 
yet  to  be  worked  out,  it  is  to  be  hoped  that  this  form 
of  distribution  will  be  substantially  less  expensive 
than  paper  journals,  and  therefore  that  these  publi- 
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cations  can  be  made  more  widely  available  than  the 
vast  and  expensive  set  of  journals  being  published 
today. 

Electronic  authors  could  shorten  or  eliminate  in¬ 
troductory  materials  by  linking  to,  rather  than  in¬ 
cluding,  them.  This  can  be  carried  one  step  further 
by  imagining  that  communities  of  researchers  will 
build  Internet  resources  for  each  research  field  that 
can  provide  common  materials:  for  example,  a  sim¬ 
ple  introduction  to  the  problems,  a  set  of  standard 
notation  and  definitions,  a  set  of  standard  test  prob¬ 
lems,  a  comprehensive  bibliography,  etc.  As  these 
resources  change  over  time  they  would  have  to  be 
dated  so  that  a  researcher  linking  to  a  publication 
written  five  years  ago  would  get  the  list  of  references 
that  existed  at  the  time  the  publication  was  writ¬ 
ten.  Electronic  publications  can  also  easily  attach 
corrections  and  additions  to  the  publication  (while 
preserving  the  integrity  of  the  original  publication). 
This  encourages  other  researchers  to  maintain  links 
to  the  publications  rather  than  saving  a  copy  that 
becomes  static  the  moment  it  is  copied. 

The  present  technology  typically  has  a  one-way 
link  from  a  publication  to  a  publication  that  it  refers 
to.  A  two-way  link  could  be  maintained  that  keeps 
with  each  publication  links  to  other  publications 
that  are  linked  to  it.  This  might  be  accomplished  by 
a  process  where  a  new  publication  “registers”  with 
some  or  all  of  the  existing  publications  it  links  to. 
This  makes  explicit  the  dynamic  web  of  publications 
that  makes  up  the  body  of  research  results  in  a  spe¬ 
cific  area. 

Links  to  other  documents  are  valuable  in  research 
papers;  however,  hypertext  does  not  have  as  much 
impact  on  research  publications  as  on  some  other 
kinds  of  publications.  In  contrast  to  publications 
that  present  an  interrelated  body  of  knowledge  that 
can  be  traversed  in  any  number  of  different  ways, 
reisearch  publications  are  essentially  a  linear  presen¬ 
tation  of  material  to  convince  the  user  about  some 
proposition.  They  are  more  like  a  legal  brief  than  a 
walk  through  a  museum  or  an  encyclopedia.  In  the 
jargon  of  hypertext,  research  publications  are  inher¬ 
ently  linear  and  links  are  primarily  used  to  connect 
to  materials  outside  the  publication  rather  than  al¬ 
lowing  the  user  to  read  the  publication  in  a  nonlinear 
fashion. 

3.  Example 

3.1  The  Chromatic  Oblique  Effect 

In  this  section  we  give  an  example  of  a  real-life  prob¬ 
lem  whose  solution  uses  these  technologies.  In  the 


vision  research  community,  it  is  well  known  that 
the  human  eye  exhibits  an  “oblique  effect.”  That 
is,  (monochromatic)  stimuli  are  perceived  differently 
depending  on  whether  they  are  oriented  horizontally 
or  vertically  on  the  one  hand,  or  at  45  degrees  or  135 
degrees  to  the  horizontal  on  the  other  (see,  for  ex¬ 
ample,  Sellers  et  al.  (1986)). 

What  is  not  known  is  whether  this  oblique  effect 
persists  in  color  vision.  Experiments  thus  far  seem 
to  have  been  inconclusive.  These  experiments  are 
difficult  to  design  and  carry  out,  because  the  col¬ 
ored  stimuli  need  to  be  presented  at  equal  lumi¬ 
nance.  (Otherwise,  of  course,  the  change  in  sensi¬ 
tivity  might  be  equally  well  ascribed  to  changes  in 
colors  or  to  changes  in  luminance.)  Since  luminance 
is  a  fact  of  perception,  different  subjects  have  differ¬ 
ent  ideas  of  what  constitutes  equiluminant  stimuli. 

A  colleague  of  ours  is  gathering  data  to  try  to  de¬ 
termine  whether  the  chromatic  oblique  effect  exists. 
In  his  experiment  a  subject  looks  at  a  screen  and 
stimuli  are  projected  onto  it.  Each  stimulus  consists 
of  sets  of  parallel  lines,  oriented  either  at  0  degrees 
or  90  degrees  (“non-oblique”)  or  at  45  or  135  degrees 
(“oblique”).  The  parallel  lines  (which  are  presented 
at  different  thicknesses  in  different  experiments)  are 
alternately  red  and  green.  Ideally  these  lines  would 
be  presented  as  equally  luminant;  that  way,  the  sub¬ 
ject  would  need  to  use  color  vision  to  differentiate 
the  parallel  lines  and  to  detect  the  stimulus.  This  is 
difficult  to  do,  despite  a  certain  amount  of  subject- 
specific  calibration  that  takes  place  before  the  exper¬ 
iment  begins.  So  in  this  experiment  the  stimuli  are 
known  to  be  nearly  but  not  exactly  equiluminant. 


Figure  1:  Grid  Stimulus 
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The  background  on  which  the  parallel  lines  are 
projected  consists  of  a  constant  color  that  is  pro¬ 
duced  by  having  each  of  the  red,  green,  and  blue 
electron  guns  in  the  monitor  operate  at  half  power. 
The  stimuli  are  produced  across  the  range  of  power 
available  to  the  red  and  green  guns. 

These  power  readings  are  standardized  so  that 
half-power  is  set  to  0,  no  power  is  —1,  and  full  power 
is  +1.  Then  each  stimulus  can  be  plotted  on  a  graph 
where  the  horizontal  axis  is  the  red  stimulus  and 
the  vertical  is  the  green.  Any  ray  out  from  the  ori¬ 
gin  then  depicts  a  set  of  constant  red/green  con¬ 
trasts,  where  the  luminance  increases  with  distance 
from  the  origin  but  there  is  no  change  in  chromatic 
contrast.  (In  the  “negative”  directions,  the  removal 
of  red  or  green  power  increases  the  ability  of  the 
subject  to  see  the  stimulus  in  contrast  to  the  back¬ 
ground.)  Figure  1  shows  (in  monochrome)  examples 
of  stimuli  with  different  orientations  (top  and  bot¬ 
tom)  and  different  thicknesses  or  spatial  frequencies 
(left  to  right).  Incidentally,  the  ability  easily  to  pro¬ 
vide  high-quality  color  graphics  is  another  advantage 
of  a  network-based  rather  than  paper-based  journal 
article. 

The  subject  is  asked  to  indicate  whether  the  stim¬ 
ulus  is  visible  or  not.  At  the  origin,  the  stimulus  is 
never  visible  because  its  luminance  is  zero  and  it 
does  not  contrast  with  the  background.  Far  from 
the  origin,  the  luminance  is  large,  and  the  stimulus 
is  always  visible.  The  object  of  the  experiment  is  to 
determine  the  thresholds:  the  critical  values  of  the 
luminance,  at  which  the  stimulus  is  visible  but  below 
which  it  cannot  be  seen.  Vision  theory  suggests  that 
the  set  of  these  thresholds,  plotted  in  this  red-green 
space,  should  form  an  ellipse  centered  at  the  origin. 
The  statistical  problem  here,  first,  is  to  fit  an  ellipse 
to  this  set  of  thresholds,  and  to  be  able  to  construct 
confidence  intervals  and  test  hypotheses  about  the 
parameters  of  the  ellipse,  and,  second,  to  make  this 
information  available  to  non-statistical  users. 

3.2  Fitting  Ellipses 

Fitting  ellipses  to  noisy  data  is  a  problem  that  has 
been  considered  for  some  time.  This  problem  arises 
most  notably  in  the  pattern  recognition  world,  where 
the  object  is  to  recognize  a  circle  that  is  presented 
to  the  viewer  at  an  angle.  Among  the  early  works  on 
this  problem  is  Bookstein  (1979);  the  recent  article 
by  Rosin  (1993)  on  d^ta  normalization  includes  a 
number  of  useful  references. 

Previous  work  on  this  problem  has  either  not  as¬ 
sumed  a  specific  error  structure  or  has  chosen  one  in¬ 
compatible  with  the  current  example.  In  the  current 


problem,  the  errors  in  the  observations  are  clearly 
radial,  since  each  set  of  stimuli  make  up  a  ray  out 
from  the  origin  of  constant  chromatic  contrast.  (In 
contrast,  common  error  models  assume  either  that 
the  X  and  y  co-ordinates  of  the  data  are  subject  to 
independent  errors,  or  that  the  error  is  perpendicu¬ 
lar  to  the  ellipse  at  that  point,  which  is  not  the  same 
as  being  radial.) 

A  second  consideration  is  that  previous  investiga¬ 
tors  have  mostly  considered  the  general  conic  section 
equation 

Axy  -f  Bx^  +  Cy'^  -h  -h  £*2/  +  F  =  0 

in  their  model  of  the  ellipse.  Our  interest  lies  more  in 
the  familiar  parameterization  where  an  ellipse  par¬ 
allel  to  one  of  the  co-ordinate  axes  satisfies 

{x  -  Xo)/a'^  +  {y-  yo)/b^  = 

because  it  is  the  semi-aoces  a  and  b  about  which  we 
would  like  draw  inferences. 

Let  us  write  our  data,  yi},  i  =  1, . . . ,  n  in  po¬ 
lar  co-ordinates  as  {ri,ti},2  =  Let  xo,yo 

be  the  ellipse’s  center  (this  is  often  assumed  to  be 
the  origin,  but  that  assertion  is  something  to  be 
tested),  and  use  9  to  designate  the  angle  the  ellipse 
makes  with  the  positive  ar-axis.  (Figure  2  shows  an 
ellipse  with  its  parameters  in  our  parametrization  in¬ 
dicated:  a  and  b  are  the  semi-axes,  0  is  the  angle  of 
inclination,  and  (xq,  yo)  are  the  co-ordinates  of  the 
center.)  As  usual,  a  “hat”  designates  a  fitted  value. 
Then  our  model  can  be  simply  stated  as 

n  =:  ri  i.i.d.  N(0,(7^). 

Suppressed  in  this  notation  is  the  dependence  of  the 
fitted  ri  on  the  underlying  parameters  a,  6, 0,  xq,  yo- 
It  can  be  argued  that  the  errors  can  be  expected  to 
be  larger  where  the  ellipse  is  wider;  perhaps  a  more 
appropriate  model  would  assume  a  constant  coeffi¬ 
cient  of  variation  for  the  c.  For  the  current  purpose 
the  assumptions  of  Normality  and  equal  variance  will 
simply  be  left  as  assumptions. 

As  in  ordinary  linear  regression,  the  maximum- 
likelihood  solution  under  this  model  coincides  with 
the  least-squares  solution.  However,  the  model  is 
non-linear  in  the  parameters,  so  some  iterative  tech¬ 
nique  is  required  to  find  the  best-fitting  set  of  param¬ 
eters.  For  this  purpose  we  constructed  a  non-linear 
regression  routine  in  Java  so  that  these  ellipses  can 
be  fit  by  both  our  local  colleague  and  by  other  re¬ 
searchers  in  other  facilities  with  different  machines 
and  software.  (Unhappily,  our  system  uses  the  latest 
version  of  Java,  version  1.1,2,  which  is  not  yet  sui>- 
ported  by  most  browsers.  However  an  application 
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Figure  2:  Geometry  of  Ellipse 

Five- Parameter  Ellipse 


version  —  a  full  executable  program,  not  an  applet 
that  runs  inside  a  browser  —  is  available  from  the 
authors  by  e-mail  for  those  users  who  have  installed 
the  Java  Run-Time  Environment.) 

4*  Results 

It  is  not  possible  to  show  in  this  journal  article  the 
full  results  of  our  labor:  this  is  part  of  our  point, 
that  this  labor  has  produced  an  algorithm  that  can 
be  used  instantly  and  need  not  be  re-created  by  in¬ 
terested  users.  However  a  couple  of  results  are  in¬ 
teresting.  After  having  written  the  code  to  find  the 
estimates  of  the  ellipse  parameters,  it  remains  to  find 
their  standard  errors.  These,  of  course,  will  be  re¬ 
quired  to  make  any  sort  of  statement  about  whether 
two  ellipses  (an  oblique  and  a  non-oblique  one,  for 
example)  are  “significantly  different.” 

One  usual  approach  to  this  problem  would  be  to 
rely  on  some  asymptotic  result.  It  is  well  known 
that  asymptotically  (under  some  regularity  condi¬ 
tions)  the  vector  of  maximum-likelihood  estimates 
is  multivariate  Normal  with  expected  value  given  by 


the  true  values  of  the  parameters.  Furthermore,  the 
covariance  matrix  of  the  estimates  is  given  by  the 
inverse  of  the  Fisher  information  matrix  (Lehmann 
(1983)).  The  Hessian  —  the  matrix  of  second  deriva¬ 
tives  of  the  log-likelihood  —  is  often  used  to  approxi¬ 
mate  the  Fisher  information,  and  this  matrix  is  easy 
to  obtain.  Our  experience  shows,  however,  that  this 
small-sample  approximation  is  not  an  accurate  one 
for  the  sample  sizes  in  our  problem  (generally  fewer 
than  80  observations). 

A  second  solution,  and  perhaps  the  most  attrac¬ 
tive,  would  come  from  simulation,  either  starting 
with  the  estimates  and  generating  a  set  of  errors 
from  the  postulated  error  distribution  or,  in  larger 
samples,  estimating  the  standard  errors  by  the  boot¬ 
strap.  The  simulation  solution  is  costly  from  a  com¬ 
putational  standpoint,  but  under  the  Internet  model 
this  cost  is  borne  by  the  users  of  the  software,  not  by 
the  developers,  since  it  runs  on  the  local  machine. 

A  third  choice,  and  one  that  our  research  has  so 
far  shown  to  be  reasonable,  is  to  rely  on  the  well- 
known  result  that  —2  times  the  log  of  a  likelihood  ra¬ 
tio  has,  asymptotically,  the  chi-squared  distribution. 
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The  software  uses  this  result  to  examine  the  hypoth¬ 
esis  that  additional  parameters  are  not  needed  in 
the  model,  and  the  associated  confidence  intervals 
consist  of  all  points  in  parameter  space  for  which 
this  test  does  not  reject  that  hypothesis.  So  for  ex¬ 
ample  we  can  examine  the  null  hypothesis  that  the 
true  center  of  a  particular  ellipse  is  in  fact  at  the 
origin  by  fitting  a  five-parameter  ellipse  and  com¬ 
paring  that  likelihood  to  the  likelihood  under  the 
three-parameter  model  in  which  the  center  is  fixed 
at  (0,0).  Under  the  null  hypothesis,  —2  times  that 
likelihood  ratio  has  the  X2  distribution.  Our  soft¬ 
ware  will  compute  this  statistic  and  report  it  and 
its  associated  p-value.  In  fact  it  will  also  draw  a 
confidence  contour  on  the  plot  of  the  thresholds  so 
that  the  investigator  can  see  clearly  the  extent  to 
which  the  ellipse  is  off  center.  (This  calculation  has 
led  to  the  conclusion  that  one  monitor  used  in  the 
experiment  has  not  been  properly  calibrated.) 

The  hypothesis  that  two  ellipses  (one  based  on 
oblique  stimuli,  the  other  on  non-oblique  stimuli)  are 
not  significantly  different  can  be  tested  by  compar¬ 
ing  the  model  that  consists  of  two  separate  ellipses 
(and  which  therefore  totals  ten  parameters)  to  the 
restricted  model  in  which  all  the  data  is  presumed 
to  come  from  a  single  ellipse  (a  model  with  five  pa¬ 
rameters).  A  nested  F-test  similar  to  the  usual  lin¬ 
ear  regression  F-test  produces  a  statistic  that  is  a 
good  approximation  to  the  F  (Seber  and  C.  Nelder 
(1989)).  As  with  the  approximation,  our  simula¬ 
tion  results  have  so  far  borne  out  this  F  approxima¬ 
tion  as  believable. 

Another  way  to  compare  data  from  oblique  and 
non-oblique  stimuli  is  to  specify  that  the  two  ellipses 
differ  only  in  a  single  parameter:  that  oblique  stim¬ 
uli  and  non-oblique  stimuli  produce  ellipses  with  the 
same  values  of  6, 0,  xq  and  j/o  but  that  the  two  values 
of  a  differ  (this  is  the  oblique  effect  in  action).  Again 
an  F-test  can  test  this  hypothesis. 

5*  Conclusions 

Statisticians  can  now  construct  and  distribute  over 
the  Internet  dynamic,  interactive  electronic  research 
papers  that  include  embedded  computer  programs 
that  can  execute  immediately  on  the  reader’s  com¬ 
puter.  This  can  allow  the  reader  to  reproduce  the 
author’s  results  and  to  immediately  apply  the  em¬ 
bedded  algorithms  to  his  or  her  own  data.  The  Java 
program  for  fitting  ellipses  to  vision  data  illustrates 
how  a  sophisticated  statistical  analysis  can  be  made 
available  instantly  to  vision  researchers  around  the 
world.  Readers  need  not  know  how  to  port  or  com¬ 
pile  code;  the  research  paper  not  only  describes  the 


steps  needed  to  perform  the  computations  but  ac¬ 
tually  performs  them  as  well.  This  ability  to  in¬ 
clude  algorithms  in  papers  brings  an  immediacy  to 
the  author-reader  interaction  that  has  not  been  pos¬ 
sibly  previously.  There  is  no  limitation  on  the  type 
of  computer  or  software  that  a  reader  needs  (other 
than  that  it  be  equipped  with  one  of  the  popular 
Web  browsers).  This  approach  allows  modifications 
of  the  code  that  the  author  makes  to  be  propagated 
instantly  to  any  users  simply  by  having  the  code 
updated  at  its  home  location.  Users  are  guaran¬ 
teed  complete  security  for  their  computer  environ¬ 
ments.  Finally,  the  speed  of  execution  is,  while  not 
the  fastest  that  can  be  achieved,  certainly  accept¬ 
able  for  this  application.  The  ability  to  write  this 
kind  of  research  paper  can  and  should  change  the 
way  statistical  research  is  performed, 
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